Appl. No. 10/017,504 
Amdt. dated October 10, 2006 



PATENT 



Amendments to the Claims: 

This listing of claims will replace all prior versions, and listings, of claims in the appUcation: 
Listing of Claims; 

1 . (Currently amended) A method for clustering a plurality of items, each of 
the items including information, guid e d toward an initial organization structure, the method 
comprising: 

inputting a plurality of items, each of the items including information, into a 
clustering process; 

inputting an initial organization structure into the clustering process, the initial 
organization structure including one or more categories, a first at least one of the cat e gories 
b e ing items being associated with at least o ne of th e it e ms the categories of the initial 
organization structure : 

processings using at least processing hardware^ the plurality of items bas e d upon 
at least th e initial organization structure and th e information in e ach of th e it e ms in at least the 
clustering process , the clustering process being based in part upon the initial organization 
structure and in part upon the information in each of the items : 

automatically determining^ using at least the processing hardware^ a resulting 
organization structm-e based upon the processing of the plurality of items, the initial organization 
structur e , and the information in e ach of th e items, the resulting organization structure 
comprising at least a portion of the initial organization structure and a modification to at least 
one additional category coupl e d to of the categories of the initial organization structure such that 
an association of a first one of the items to at least one of the categories of the resulting 
organization structure is different fi"om an association of the first one of the items to at least one 
of the categories of the initial organization structure : and 

storing the resulting organization structure in memory. 
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2. (Currently amended) The method of ekkH-jr claim 20 wherein the 
processing comprises determining a likeness level between a first item and a second item, the 
likeness level being defined such that the likeness level between two items is increased if^ey 
are to the extent that both of the two items are similar to it e ms in associated with a same one or 
more of the categories of the initial organization structure. 

3 . (Currently amended) The method of claim 2 wherein the determining the 
likeness level between the first item and the second item comprises: 

associating a first feature vector with the first item and a second feature vector 
with the second item, each feature vector representing information associated with included in 
eaek the corresponding item; 

adding a first additional feature and a second additional f e atur e to each of the first 
feature vector and the second feature vector of th e first item and th e s e cond item, resp e ctiv e ly, 
the first additional feature representing a first category of the initial organization structure, 
structur e and the s e cond additional featur e r e pr e senting a second cat e gory of th e initial 
organization structur e , the first additional feature in the first feature vector corresponding to 
providing a degree to which the first item is similar to one or more it e ms in items associated with 
the first category of the initial organization structur e, the first additional feature in the second 
feature vector corresponding to a degree to which the second item is similar to one or more items 
associated with the first category of the initial organization structure: and 

calculating a degree of similarity of the first item and the second item including 
calculating a similarity measure between the first additional feature and the s e cond additional 
feat^ in the first feature vector and the second feature vector. 

4. (Canceled) 

5. (Original) The method of claim 1 wherein the resulting organization 
structure relates to the initial organization structure based upon a category similarity. 
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6. (Original) The method of claim 1 wherein the resulting organization 
structure relates to the initial organization structure based upon a similarity of hierarchy 
structure. 

7. (Original) The method of claim 1 wherein the item is a document, a 
product, a person, a DNA sequence, a purchase transaction, a financial record, or a species 
description. 

8. (Original) The method of claims 1 further comprising outputting the 
resulting organization structure on an output device. 

9. (Original) The method of claim 1 wherein the processing hardware uses at 
least a 500 MHz clock to efficiently run the clustering process. 

1 0. (Original) The method of claim 1 wherein the plurality of items includes 
at least 10,000 items. 

11. (Currently amended) A computer aided information organization device, 
the device including one or more computer memories, the one or more computer memories 
including: 

a first code directed to inputting at least 10,000 items in electronic form into a 
clustering process, each of the items including information; 

a second code directed to inputting an initial organization structure in electronic 
form into the clustering process, the initial organization structure including one or more 
categories, a first at l e ast one of the cat e gori e s being items being associated with at least one of 
the jtems categories of the initial organization structure : 

a third code directed to processings using at least processing hardware^ the 
plurality of items bas e d upon at least th e initial organization structure and th e information in 
each of th e it e ms in at least the clustering process , the clustering process being based in part 
upon the initial organization structure and in part upon the infonna.tion in each of the items : 
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a fourth code directed to automatically determinin g, using at least the processing 
hardware, a resulting organization structure based upon the processing, the resulting organization 
structure comprising at least a portion of the initial organization structure and a modification to 
at least one additional cat e gory coupl e d to of the categories of the initial organization structure 
such that an association of the first one of the items to at least one of the categories of the 
resulting organization structure is different from an association of the first one of the items to at 
least one of the categories of the initial organization structure : and 

a fifth code directed to storing the resulting organization structure in the one or 
more memories or another memory. 

12. (Currently amended) The device of claim 1 1 further comprising a sixth 
code directed to determining a likeness level between a first item and a second item, the likeness 
level being defined such that the likeness level between th e first item and th e s e cond it e m two 
items increases if th e y ar e both of the two items are similar to items in associated with a same 
one or more of the categories in the initial organization structure. 

13. (Currently amended) The device of claims 12 wherein the sixth code 
directed to determining the likeness level between the first item and the second item comprising: 

a code directed to associating a first feature vector with the first item and a second 
feature vector with the second item, each feature vector relating to information associat e d with 
including in the corresponding item; 

a code directed to extending the feature vector of each of the first item and the 
second item with an additional feature representing a category of the initial organization 
structure, the additional feature in each feature vector relating to a degree to which fee respective 
item is similar to one or more it e ms in items associated with the corresponding category of the 
initial organization structure; and 

a code directed to calculating a measure of similarity of the first item with the 
second item including calculating the similarity measure between the fifst additional feature aad 
the second additional featur e in the first feature vector and the second feature vector . 
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14. (Canceled) 

15. (Original) The device of claim 1 1 further comprising a sixth code directed 
to outputting the resulting organization structure, the resulting organization structure including a 
plurality of categories. 

16. (Original) The device of claim 15 further comprising a seventh code 
directed to inputting additional items using the resulting organization structure. 

17. (Original) The device of claim 1 1 further comprising a sixth code directed 
to independently modifying the resulting organization structure using a graphical user interface. 

1 8 . (Original) The device of claim 1 7 wherein the independently modifying is 
provided by a user coupled to the graphical user interface. 

1 9. (Currently amended) A computer implemented method for clustering a 
plurality of items, the method comprising: 

inputting a first hierarchy, the first hierarchy including at least on e cat e gory a 
parent category and a first child category : 

inputting a plurality of items, each of the plurality of items including information^ 
at least one of the items being associated with one of the categories in the first hierarchy : 

th e at l e ast one category b e ing associated with on e of th e items; 

processing by the computer the plurality of items in a clustering process based 
upon at least the first hierarchy and the information in each of the items; 

automatically determining by the computer a second hierarchy based upon the 
processing of the plurality of items in the clustering process, th e first hierarchy, and th e 
information in each of the it e ms, the second hierarchy including a portion of the first hierarchy 
and at l e ast on e one or more additional category categories coupled to the first hierarch y, at least 
one of the one or more additional categories being a second child category of the parent category 
of the first hierarchy: 

storing the second hierarchy in memory; and 
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assigning each of the plurality of items to a category of the second hierarchy. 

20. (New) The method of claim 1 wherein the clustering process is biased 
toward the categories of the initial organization structure. 

21 . (New) The method of claim 20 wherein the modification to at least one of 
the categories in the initial organization structure includes a modification to a first category such 
that the first item is associated with the first category of the resulting organization structure but is 
not associated with the first category of the initial organization structure. 

22. (New) The method of claim 20 wherein the modification to at least one of 
the categories in the initial organization structure includes a modification to a first category such 
that the first item is associated with the first category of the initial organization structure but is 
not associated with the first category of the resulting organization structure. 

23. (New) The method of claim 1 wherein the resulting organization structure 
includes at least one additional category coupled to the initial organization structure. 

24. (New) The method of claim 19 wherein the clustering process guarantees 
that the resulting organization structure is an extension of the initial organization structure. 

25. (New) The method of claim 24 wherein processing in the clustering 
process comprises: 

forming a tentative cluster fi-om at least two of the items; 

determining whether the tentative cluster is consistent with an extension of the 
initial organization structure; and 

rejecting the tentative cluster in the event that the tentative cluster is inconsistent 
with an extension of the initial organization structure. 

26. (New) The method of claim 19 wherein the first hierarchy includes at 
least a parent category and a child category and wherein the guided clustering process includes: 
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interposing a new category between a parent category of the initial organization 
structure and a child category of the initial organization structure. 

27 (New) A method for clustering a plurality of items, each of the items 
including information, the method comprising: 

inputting a plurality of items, each of the items including information, into a 
clustering process; 

inputting an initial organization structure into the clustering process, the initial 
organization structure including one or more categories, wherein at least a first one of the items 
is not associated with any of the categories of the initial organization structure; 

processing, using at least processing hardware, the plurality of items in at least the 
clustering process, the clustering process being based in part upon the initial organization 
structure and in part upon the information in each of the items; 

automatically determining, using at least the processing hardware, a resulting 
organization structure based upon the processing of the plurality of items, the resulting 
organization structure comprising at least a portion of the initial organization structure and a 
modification to the initial organization structure such that the first one of the items is associated 
with at least one of the categories in the resulting organization structure; and 

storing the resulting organization structure in memory. 
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